Chunking German: An Unsolved Problem
نویسندگان
چکیده
This paper describes a CoNLL-style chunk representation for the Tübingen Treebank of Written German, which assumes a flat chunk structure so that each word belongs to at most one chunk. For German, such a chunk definition causes problems in cases of complex prenominal modification. We introduce a flat annotation that can handle these structures via a stranded noun chunk.
منابع مشابه
Alignment-Guided Chunking
We introduce an adaptable monolingual chunking approach–AlignmentGuided Chunking (AGC)–which makes use of knowledge of word alignments acquired from bilingual corpora. Our approach is motivated by the observation that a sentence should be chunked differently depending the foreseen end-tasks. For example, given the different requirements of translation into (say) French and German, it is inappro...
متن کاملUsing Constraint Grammar for Chunking
This paper presents and evaluates a novel and flexible chunking method using Constraint Grammar (CG) rules to introduce chunk edges in corpus annotation. Our method exploits preexisting (non-constituent) morphosyntactic annotation such as part-of-speech or function tags, but can also be made to work on raw text, integrated with other CG modules. The first version of the chunker was developed fo...
متن کاملImpact of Prosodic Position on Vocalic Space in German and French
The relation between prosodic constituents of different levels (the prosodic hierarchy) and the realization of phonemes is investigated in French and German broadcast speech. Four levels of prosodic categories are defined and the boundaries are automatically derived using syntactic chunking and forced alignment. We show that the higher a vowel position within the prosodic structure, the more st...
متن کاملA High-Performance Semi-Supervised Learning Method for Text Chunking
In machine learning, whether one can build a more accurate classifier by using unlabeled data (semi-supervised learning) is an important issue. Although a number of semi-supervised methods have been proposed, their effectiveness on NLP tasks is not always clear. This paper presents a novel semi-supervised method that employs a learning paradigm which we call structural learning. The idea is to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010